Storage Layout and I/O Performance in Data Warehouses
نویسندگان
چکیده
Defining data placement and allocation in the disk subsystem can have a significant impact on data warehouse performance. However, our experiences with data warehouse implementations show that the database storage layout is often subject to vague or even invalid assumptions about I/O performance trade-offs. Clear guidelines for the assignment of database objects to disks are a very common request from data warehouse DBAs and consultants. We review best practices suggested by storage and database vendors, and present two sets of performance measurements that compare storage layout alternatives and their implications. The first set used a TPC-H benchmark workload with DB2 UDB, the other a star schema/star join scenario with IBM Red Brick Data Warehouse.
منابع مشابه
Layout Design of Multiple Blocks Class-Based Storage Strategy Warehouses
Manual order-picking warehouses are highly influenced by its’ layout design. In fact, layout parameters play the most important role in determining the route length which is responsible for the largest share in a warehouse operating costs. Conventional simulation techniques are not capable to capture the complexities in dynamic systems. Therefore, this paper develops an agent-based model to sim...
متن کاملAn EOQ model for non-instantaneous deteriorating items with two levels of storage under trade credit policy
A deterministic inventory model with two levels of storage (own warehouse and rented warehouse) with non-instantaneous deteriorating items is studied. The supplier offers the retailer a trade credit period to settle the amount. Different scenarios based on the deterioration and the trade credit period have been considered. In this article, we have framed two models considering single warehouse ...
متن کاملGong, Zhenhuan. Multi-level Data Layout Optimization for Heterogeneous Access Patterns. (under the Direction of Dr. Nagiza F. Samatova.) Multi-level Data Layout Optimization for Heterogeneous Access Patterns
GONG, ZHENHUAN. Multi-level Data Layout Optimization for Heterogeneous Access Patterns. (Under the direction of Dr. Nagiza F. Samatova.) Recent years have seen an enormous increase in computation power of leadership computing facilities. As a result, huge amounts of data, from terascale to petascale, are being produced by scientific applications running on supercomputers. However, the I/O subsy...
متن کاملEfficient Aggregation Algorithms for Compressed Data Warehouses
ÐAggregation and cube are important operations for online analytical processing (OLAP). Many efficient algorithms to compute aggregation and cube for relational OLAP have been developed. Some work has been done on efficiently computing cube for multidimensional data warehouses that store data sets in multidimensional arrays rather than in tables. However, to our knowledge, there is nothing to d...
متن کاملImproving Throughput for Small Disk Requests with Proximal I/O
This paper introduces proximal I/O, a new technique for improving random disk I/O performance in file systems. The key enabling technology for proximal I/O is the ability of disk drives to retire multiple I/Os, spread across dozens of tracks, in a single revolution. Compared to traditional update-in-place or write-anywhere file systems, this technique can provide a nearly seven-fold improvement...
متن کامل